Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task
نویسندگان
چکیده
In this paper we discuss various experimental results using our continuous speech recognition system on the Wall Street Jounal task. Experiments with diierent feature extraction methods, varying amounts and type of training data, and diierent vocabulary sizes are reported.
منابع مشابه
The RWTH large vocabulary continuous speech recognition system
In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...
متن کاملThe Rwth Speech Recognition System and Spoken Document Retrieval
In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...
متن کاملTranscribing broadcast news shows
While significant improvements have been made over the last 5 years in large vocabulary continuous speech recognition of large read-speech corpora such as the ARPA Wall Street Journal-based CSR corpus (WSJ) for American English and the BREF corpus for French, these tasks remain relatively artificial. In this paper we report on our development work in moving from laboratory read speech data to r...
متن کاملIssues in Large Vocabulary, Multilingual Speech Recognition
In this paper we report on our activities in multilingual, speaker-independent,large vocabulary continuous speech recognition. The multilingual aspect of this work is of particular importance in Eu-rope, where each country has its own national language. Our existing recognizer for American English and French, has been ported to British English and German. It has been assessed in the context of ...
متن کاملOn designing pronunciation lexicons for large vocabulary, continuous speech recognition
Creation of pronunciation lexicons for speech recognition is widely acknowledged to be an important, but labor-intensive, aspect of system development. Lexicons are often manually created and make use of knowledge and expertise that is difficult to codify. In this paper we describe our American English lexicon developed primarily for the ARPA WSJ/NAB tasks. The lexicon is phonemically represent...
متن کامل